BoosTexter:A Boosting-based System for Text Categorization

----------Robert E.Schapire
----------Yoram Singer
Abstract:This work focuses on algorithms which learn from examples to perform multiclass text and Speech categorization tasks. Our approach is based on a new and improved family of boosting algorithms. We describe in detail an implementation, called BoosTexter, of the new boosting algorithms for text categorization algorithms on a variety of tasks. We conclude by describing the application of our system to automatic call-type identification from unconstrained spoken customer responses.
现在我们解读一下这个摘要,第一句介绍这个方法的应用:多类文档分类和演讲类型分类;然后说的是本方法的介绍,来源于增强算法的改进;接着把这一系列(四个)方法做成的一个系统的一个应用例子写出来;本文的总结部分也是描述一个实际应用,用于现实中的一个演讲分类例子。

文章中介绍的四个方法很有特点,虽然使用的是boosting方法但是权值调节方程有所改进,分别使用hanming度量和ranking方法来进行阈值函数的设置,由此设计出的面向四种类型的应用的方法。